﻿**************************************************************************
README for the Replication Archive of "Destruction from Above: Long-Term Legacies of the Tokyo Air Raids"
Masataka Harada, Gaku Ito and Daniel M. Smith
Journal of Politics

**************************************************************************

This document provides replication data and codes to reproduce the tables and figures in the Journal of Politics article "Destruction from Above: Long-Term Legacies of the Tokyo Air Raids." Please cite this article if any part of the replication archive is used.

This README document includes:

1. Contents of replication archive
2. Original data source information 
3. Software dependencies

**************************************************************************
1. CONTENTS OF THE REPLICATION ARCHIVE

In addition to this README.txt file, the replication archive contains the following two folders: 

(1) "1_Data" folder contains data files for the code.
(2) "2_Code" folder contains the R script and Stata-do files to run the code.
* In addition to these files, the datasets for human coding of the neighborhood level air raid damages evaluation are available for download from the following URL: https://doi.org/10.7910/DVN/QR2JNC
* The codes generates the following three folders: “3_Result”, “4_Figures”, and “5_Working”.
* All files contained in "1_Data" folder must be placed directly under "1_Data" folder. If the files are compressed in Zip format, please unzip them (See (1)(i)).

The contents of each folder are described in further detail below.
(1) "1_Data" folder contains:
(a) Aerial_Photo_Freq.dta (Data with the date of the aerial photograph to create Figure E1.)
(b) ANA_zipcode_data.csv (csv file used to create Table A12)
(c) census_data4.csv (csv file containing the data of census) 
(d) census_zipcode_data.csv (csv file used to create Figure A9) 
(e) eval1.xlsx (excel file containing the data for inter-coder reliability test to produce Figure H3)
(f) georeffed_red.tiff (geo-tiff file to produce Figure H3) 
(g) keishicho-tokeisho.xlsx (excel file containing historical statistics to produce Figure I1)
(h) PairwiseDistance_Poly2AP.csv (csv file containing the distance between each neighborhood and the closest and 2nd closest aiming points.)
(i) raid_shape.zip file containing raid_shape.shp, raid_shape.dbf, raid_shape.prj, raid_shape.shx (Shape file used to produce Figure H3 and other relevant statistics in Appendix H). Please unzip it and place the four files directly under "1_data" folder.
(j) RaidShp_May2020b.rds (data file in R (sf) format containing the neighborhood-level polygons, information on damages, residential ratio, and prewar population density to create some of the figures and extract the damages for each neighborhood) 
(k) sports_club_list.csv (csv file containing the data of authorized neighborhood association) 
(l) TargetLocationsCombined.rds (R data file containing the coordinates information on the aiming points) 
(m) test_ninka_2.dta (Stata file containing the data of authorized neighborhood association) 
(n) test_ninka_sinceFY1993_2.dta (Stata file containing the data of authorized neighborhood associations, which obtained their ANA status after 1993)

(2) "2_Code" folder contains the following files:
(a) 1_PackagesFunctions.R (R script to install required packages)
(b) 2_NinkaRegression_MainTxt&AppxA.R (R script to produce the figures and tables regarding the analysis with ANA as the outcome variable, which appears either in the main text or Appendix A.)
(c) 3_Census_MainTxt&AppxA.R (R script to produce the figures and tables regarding the analysis with the census as the outcome variable, which appears either in the main text or Appendix A.)
(d) 4_Maps_MainTxt&AppxA.R (R script to produce the maps, which appear either in the main text or Appendix A.)
(e) 5_Fig8_sequential_g_estimation.R (R script to produce Figure 8 showing the results of sequential g-estimation.)
(f) 6_Appendix_BthruK.R (R script to produce the figures and tables which appear in Appendix B through K.)
(g) 7_fig_e1.do (Stata do-file to produce the original Figure E1. File (f) contains the code to reproduce the same figure in R.) 
(h) 8_Codebook.xlsx (The codebook for each dataset is contained in a separate worksheet.)
(i) 9_Log.pdf (Log file in PDF format that records the results of executing the codes 1-8 above.)


**************************************************************************
2. ORIGINAL DATA SOURCE INFORMATION

(a) Aerial photographs were downloaded from the map and aerial photo viewing service by the Geospatial Information Authority of Japan.
URL: https://mapps.gsi.go.jp
(b) ANA information was obtained from the ward offices in the 23 wards of Tokyo. 
(c) Census variables come from the small area census of Japan.
URL: https://www.e-stat.go.jp/
(d) The maps of the affected areas were digitized by NHK from books published by the Japan Map Center. URL: https:// www.nhk.or.jp/ archives/ shogenarchives/ special/ tokyodaikushu/
(e) Prewar demographic statistics aggregated by police jurisdiction are from the annual statistics of the Tokyo Metropolitan Police Department in 1940 and 1944.
(f) Geographic information was obtained from the Ministry of Land, Infrastructure, Transport and Tourism (MLIT) National Land Numerical Information. URL: http://nlftp.mlit.go.jp/ksj-e/index.html.
(g) Sports club data were obtained from Tokyo Metropolitan Government and Tokyo Sports Culture Corporation.
URL: https://club-tokyo-sports.jp/tokyo-chiiki-sports-club/list/
(h) The data on aiming points were obtained from Okuzumi and Hikasa (2005).

**************************************************************************
3. SOFTWARE DEPENDENCIES

All analyses were conducted using R4.2.2 and Stata 17 and confirmed to replicate on Windows 10 and 11.
